Retroviral promoters in the human genome
نویسندگان
چکیده
MOTIVATION Endogenous retrovirus (ERV) elements have been shown to contribute promoter sequences that can initiate transcription of adjacent human genes. However, the extent to which retroviral sequences initiate transcription within the human genome is currently unknown. We analyzed genome sequence and high-throughput expression data to systematically evaluate the presence of retroviral promoters in the human genome. RESULTS We report the existence of 51,197 ERV-derived promoter sequences that initiate transcription within the human genome, including 1743 cases where transcription is initiated from ERV sequences that are located in gene proximal promoter or 5' untranslated regions (UTRs). A total of 114 of the ERV-derived transcription start sites can be demonstrated to drive transcription of 97 human genes, producing chimeric transcripts that are initiated within ERV long terminal repeat (LTR) sequences and read-through into known gene sequences. ERV promoters drive tissue-specific and lineage-specific patterns of gene expression and contribute to expression divergence between paralogs. These data illustrate the potential of retroviral sequences to regulate human transcription on a large scale consistent with a substantial effect of ERVs on the function and evolution of the human genome.
منابع مشابه
Retroviral Transduction of Fluonanobody and the Variable Domain of Camelid Heavy-Chain Antibodies to Chicken Embryonic Cells
Background: Single domain antibodies from camel heavy chain antibodies (VHH or nanobody), are advantages due to higher solubility, stability, high homology with human antibody, lower immunogenicity and low molecular weight. These criteria make them candidates for production of engineered antibody fragments particularly in transgenic animals. Objective: To study the development of transgenic ch...
متن کاملHigh-definition mapping of retroviral integration sites identifies active regulatory elements in human multipotent hematopoietic progenitors.
Integration of retroviral vectors in the human genome follows nonrandom patterns that favor insertional deregulation of gene expression and increase the risk of their use in clinical gene therapy. The molecular basis of retroviral target site selection is still poorly understood. We used deep sequencing technology to build genomewide, high-definition maps of > 60 000 integration sites of Molone...
متن کاملIdentification of promoter regions in the human genome by using a retroviral plasmid library-based functional reporter gene assay.
Attempts to identify regulatory sequences in the human genome have involved experimental and computational methods such as cross-species sequence comparisons and the detection of transcription factor binding-site motifs in coexpressed genes. Although these strategies provide information on which genomic regions are likely to be involved in gene regulation, they do not give information on their ...
متن کاملEctopic Expression of Embryo/Cancer Sequence A (ECSA) in KYSE-30 Cell Line Using Retroviral System
Background Human preimplantation embryonic cells share many similarities with cancer cells such as ability to self-renew, unlimited proliferation and maintenance of the undifferentiated state. Embryo-cancer sequence A (ECSA), also known as developmental pluripotency associated-2 (DPPA2), is a cancer testis antigen (CTA) with unclear biological function yet. Objective: CTAs are expressed normal...
متن کاملTranscription Factor Binding Sites Are Genetic Determinants of Retroviral Integration in the Human Genome
Gamma-retroviruses and lentiviruses integrate non-randomly in mammalian genomes, with specific preferences for active chromatin, promoters and regulatory regions. Gene transfer vectors derived from gamma-retroviruses target at high frequency genes involved in the control of growth, development and differentiation of the target cell, and may induce insertional tumors or pre-neoplastic clonal exp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 24 14 شماره
صفحات -
تاریخ انتشار 2008